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Introduction 

At  each  instant  of  time  a  depletable  resource  can  be  drawn  from 
any  one  of  a  number  of  pools.   The  cost  of  removing  an  extra  unit  from  a 
pool  depends  on  how  much  has  already  been  taken  out  of  it.   What  policy 
supplies  a  fixed  flow  of  the  resource  at  minimum  present  discounted  cost? 

It  is  easy  to  characterize  the  optimal  rule  in  a  classical  environ- 
ment where  every  pool  has  nondecreasing  extraction  costs.   At  any  time 
simply  draw  the  required  amount  from  the  source  with  lowest  marginal  cost. 
But  what  happens  if,  as  in  the  real  world,  incremental  extraction  costs 
decline  in  some  initial  range?   In  the  general  case,  a  marginalist  policy 
of  exploiting  the  least  cost  source  would  be  sub-optimal. 

The  present  paper  shows  that  a  natural  generalization  of  the 
marginalist  rule  is  optimal  when  resource  pools  have  arbitrary  extraction 
costs.   A  key  solution  concept  turns  out  to  be  the  "equivalent  stationary 
cost"  of  exploiting  the  resource.   Properties  of  an  optimal  policy  are 
derived  and  interpreted. 

The  Model 

The  present  paper  is  primarily  an  exercise  in  trying  to  characterize, 
at  a  high  level  of  abstraction,  the  basic  form  of  an  optimal  policy  for  ex- 
ploiting natural  resources.   This  goal  can  be  most  conveniently  served  by 
treating  the  underlying  prototype  problem  in  a  discrete  formulation.— 

A  "generalized  resource"  is  extracted  from  "generalized  resource 
pools",  indexed  by  i.   The  system  is  in  state  {m.  },  where  each  m.  is  a 


—  A  continuous  version  yields  completely  analogous  results,  but  the  mathematical 
analysis  is  intrinsically  more  technical.   Actually,  the  form  of  an  optimal 
policy  in  the  continuous  case  is  most  easily  proved  by  first  treating  the 
corresponding  discrete  or  integer  mode  and  then  carefully  taking  the  limit  as 
the  step  size  approaches  zero. 
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non-negative  integer,  if  amount  m.  of  the  resource  has  already  been  extracted 
from  pool  i.   A  function 

gives  the  cost  of  extracting  one  more  unit  of  the  resource  from  pool  i  when 

m  units  have  already  been  taken  from  that  source.   If  no  more  than  amount 

m  can  be  gotten  from  pool  i,  we  symbolically  write  F.(m)  =  =°. 

Let  V.  be  an  indicator  variable  which  takes  on  value  unity  if 
1 

pool  i  is  being  exploited  and  zero  otherwise.   A  transition  can  be  made 
from  state 


{m.} 

1 


to  state 


{m.+v.} 

X   X 


where 


v.  =  0  or  1 

1 


E  V.  =  1, 

X 

X 


at  a  cost  of 


E  V.  F.(m.) 

XXX 

X 


There  is  a  constant  demand  for  the  resource  which  must  be  satisfied. 
Without  loss  of  generality,  time  is  measured  so  that  demand  equals  one.   Let  a 
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2/ 
be  the  appropriate  one  period  discount  factor.   The  basic  problem  —  is  to 

determine  control  variables  {v.(t)}  which  will 


minimize 


subject  to 


Z     a^    Z   v.(t)  F.(m.(t))  (1) 

.111 
t=0    i 


m.(t+l)  =  m^(t)  +  v^(t)  (2) 


2  v^(t)  =  1  (3) 

i 


v.(t)  =  0  or  1  .  (4) 


There  are  many  ways  in  which  the  actual  economics  of  extraction 

3/ 
is  more  complicated  than  the  simple  model  presented  here.—   Nevertheless, 

the  above  formulation  might  be  defended  as  a  rough  approximation  to  the 

general  issue  of  finding  the  best  pattern  for  developing  a  resource  from 

alternative  sources  or  pools.   The  fact  that  it  is  possible  to  sharply 

characterize  an  optimal  solution  makes  problem  (l)-(4)  a  natural  preliminary 

to  any  more  general  analysis.   And  it  may  even  be  a  reasonable  description 

of  some  situations. 

Depending  on  the  context,  the  various  pools  might  be  mines,  pits, 

wells,  deposits,  fields,  regions,  or  even  resource-based  technologies. 


2/ 

—  Implicitly  we  are  assuming  that  (l)-(4)  is  well  formulated.   In  fact  it 

is  not  difficult  to  specify  sufficient  conditions  for  guaranteeing  the 
existence  of  a  solution. 

3/ 

—  Just  to  mention  a  few:   uncertainty  may  be  present;  demands  and  discount 

factors  could  vary;  extraction  costs  might  depend  on  current  outflow  as  well 
as  on  how  much  has  been  cumulatively  taken  from  a  pool.   Equation  (4)  could 
be  replaced  by  the  seemingly  weaker  "mixing"  condition  0  <  v.(t)  <  1,  but 

only  at  the  expense  of  making  the  analysis  more  intricate  without  otherwise 
altering  results.   Note  that  when  time  units  are  sufficiently  small  any 
fractional  extraction  policy  can  be  approximated  to  an  arbitrary  degree  of 
accuracy  by  a  discrete  policy  of  the  form  (4)  (which  rapidly  switches  back 
and  forth  among  the  pools  being  "mixed") . 


_4- 

Actually,  with  a  judicious  interpretation  of  "resource  pool",  formulation 
(l)-(4)  is  sufficiently  general  to  include  as  special  cases  many  of  the 
standard  deterministic  operations-research  models  for  such  problems  as  equip- 
ment selection  and  replacement,  inventory  and  production  scheduling,  or 

.,        .   4/ 
capacity  expansion.— 

Non-Decreasing  Costs:   A  Special  Case 

Consider  now  the  classical  assumption  of  constant  or  increasing 
marginal  cost.   In  sjnnbols, 

F.(m)  <  F.(m')  (5) 

for  all  i,m,m'  such  that 

m  <  m'  . 

When  restriction  (5)  is  imposed,  it  is  easy  to  characterize  an 
optimal  solution  to  problem  (l)-(4).   Always  extract  the  next  unit  from 
the  cheapest  source  currently  available.   At  time  t,  fix 


v.^(t)  =  1 


for  any  integer  i*  satisfying 


F,*(ra_(t))  =  min  F  (m.  (t))  .  (6) 

1"   1''        .    1   1 

1 


4/ 

—  In  such  situations  there  are  typically  several  classes  of  pools,  each  of 

which  contains  an  infinite  number  of  identical  members.   For  equipment 

problems  a  "resource  pool"  is  a  certain  piece  of  equipment  and  the  "amount 

extracted"  is  the  length  of  time  it  has  been  in  service.   With  scheduling, 

inventory,  and  expansion  problems,  a  pool  is  a  certain  strategy-schedule 

(like  ordering  inventory  or  building  capacity)  which  goes  up  to  some  expiration 

or  regeneration  point  (after  which  costs  are  interpreted  as  being  infinite) ; 

the  "amount  extracted"  is  the  length  of  time  the  given  strategy-schedule  has 

been  carried  out. 


i 
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That  such  a  policy  is  optimal  follows  directly  from  applying  the  standard 
marginalist  argument  to  a  situation  with  convex  cumulative  cost  functions 
and  positive  time  discounting. 

It  is  important  to  realize  that  the  marginalist  selection  rule  (6) 
does  not  give  correct  answers  in  a  non-convex  environment  where  (5)  fails 
to  hold.   If  extraction  costs  can  decline,  an  optimal  policy  may  require 
exploiting  a  source  having  high  initial  but  low  eventual  costs.   There  is 
no  way  that  a  completely  myopic  algorithm  which  looks  only  at  incremental 
costs  will  pick  out  this  kind  of  policy.   The  correct  solution  concept,  if 
there  is  one  for  a  problem  seemingly  as  complex  as  (l)-(4),  would  have  to 
somehow  look  ahead  at  the  entire  pattern  of  future  costs. 

Now  we  can  pose  the  following  basic  question.   Is  there  any 
simple  abstraction  or  generalization  of  the  marginalist  principle  which 
works  when  no  structure  whatsoever  is  placed  on  the  cost  coefficients 
{F.(a)}?   The  answer  turns  out  to  be  yes. 

The  algorithm  which  is  developed  in  this  paper  for  dealing  with 
an  arbitrary  cost  structure  may  be  of  some  interest.   Situations  with  a 
range  of  decreasing  marginal  cost  are  frequently  the  rule  rather  than  the 
exception.   Substantial  set-up  costs  in  overhead,  research,  development,  or 
the  like  must  usually  be  incurred  just  to  open  up  a  resource  pool.   Then 
there  is  the  pervasive  feature  of  learning  by  doing  or  cumulative  technological 
progress,  which  is  likely  to  lower  costs  over  at  least  the  beginning  stages 
of  exploiting  a  new  source.   It  is  true  that  costs  of  extracting  from  a 
given  pool  will  eventually  rise  if  and  when  resource  deposits  start  to  give 
out.—   But  the  evidence  suggests  that  incremental  costs  are  not  universally 
non-decreasing  over  their  entire  range. 


—  Even  then,  fresh  investments  in  secondary  and  tertiary  recovery  methods 
may  cut  subsequent  operating  costs,  at  least  temporarily. 
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Solving  the  General  Case 

If  amount  m  has  already  been  drawn  from  pool  i,  the  equivalent 
stationary  cost  of  taking  the  next  n  units  in  a  row  from  it  is  the  weighted 
average: 

n-1 
E   F . (m+j )  a^ 

4'^(m)  -=  ^^ .  (7) 

1      n-i 

E   aJ 

j=0 

Let  n  be  the  time  horizon  which  minimizes  (7): 

"i^im)    =    min    ¥?(m)  .  (8) 

n=l,2,"- 

We  permit  the  case  n  =  <».   This  could  occur,  for  example,  when  costs  are 
everywhere  non-increasing. 

The  implicit  cost  of  pool  i  (when  m  units  have  been  extracted 
from  it)  is  defined  as  its  smallest  equivalent  stationary  cost 

$.(m)  =  'i'^Cm)  .  (9) 

11 

The  following  decision  rule  completely  characterizes  an  optimal 
policy  in  the  general  case. 

Always  extract  the  next  resource  unit  from  the  pool  with  lowest 
implicit  cost.   Symbolically,  in  an  optimal  policy,  at  each  t 


v.^(t)  =  1  (10) 


for  any  integer  i*  satisfying 


?..(m..(t))  =min$.(m  (t))  .  (11) 

i'^   I''        .    1   1 

1 


1 

i 
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Converting  arbitrary  cost  streams  to  stationary  equivalents  for 
the  purpose  of  finding  the  cheapest  alternative  is  an  old  economist's 
trick.   The  optimality  of  decision  rule  (10) , (11)  can  in  a  sense  be  interpreted 
as  justifying  this  heuristic  procedure. 

Note  that  the  implicit  cost  of  a  pool  reduces  to  its  marginal 
cost  for  the  special  case  of  non-decreasing  costs.   This  is  because  when  (5) 
holds,  the  horizon  n=l  is  always  a  solution  of  (8);  and  then  (9)  becomes 

$  .  (m)  =  F .  (m)  . 

So  the  decision  rule  (11)  is  indeed  a  generalization  of  the  marginalist 
principle  (6). 

It  is  not  necessary  to  recalculate  implicit  costs  in  each  time 
period.   If  a  pool  is  not  used,  its  implicit  cost  does  not  change.   If  it 
is  used,  its  implicit  cost  must  have  been  lower,  or  at  least  no  higher,  than 
the  implicit  cost  of  any  other  pool.   But  then  it  will  remain  the  lowest 
implicit  cost  pool  (and  hence  will  continue  to  be  exploited)  for  at  least 
a  number  of  periods  equal  to  the  equivalent  stationary  cost  minimizing 
horizon  n  of  (8) . 

This  last  conclusion  can  be  derived  as  follows.   Split  the  time 
Interval  [0,n)  into  two  sub- intervals  [0,h)  and  [h,n)  for  h  any  integer 
between  1  and  n-1.   The  equivalent  stationary  cost  of  extracting  over  the 
interval  [0,n)  is  a  weighted  average  of  the  equivalent  stationary  costs  over 
its  component  sub-intervals: 


4''^(m)  =/-i=^  )l'^(m)  +  fl  - 
1-a  / 


l-g  i  \   „,n-h 


1-a^J 


r.      (m+h)  .    (12) 


We  know  from  (8)  that 
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"V^im)    <    ^i^im)    . 


It  follows  that 


Y?  '^(m+h)  <  *^(m) 


Using  (9) ,  the  above  expression  becomes 


I-"  ^(m+h)  <  $.(m)  .  (13) 


1        -   1 


But 


$.(m+h)  <  'F^  ^(m+h)  . 
1      -   1 

Thus, 

$ . (m+h)  <  $ . (m) 
X       -   1 

for  h  =  l,2,*'«,n-l,  the  result  to  be  proved. 

To  illustrate  the  typical  form  of  an  optimal  policy  for  exploiting 
depletable  natural  resources,  consider  the  following  simple  example.   Suppose 
there  are  but  three  resource  pools.   Each  pool  has  an  initial  range  of  de- 
creasing costs,  followed  by  a  final  section  of  increasing  costs.   Let  the 
pools  be  ordered  so  that  the  first  has  lower  implicit  cost  than  the  second, 
which  in  its  turn  is  lower  than  the  third.   The  optimal  strategy  will  be  to 
initially  exploit  the  pool  with  lowest  implicit  cost,  pool  number  one. 
This  will  be  done  until  the  marginal  cost  of  extracting  one  more  unit  (in 
the  increasing  cost  range)  becomes  greater  than  the  implicit  cost  of  the 
second  pool.   At  that  time  pool  two  will  start  being  exploited,  and  it  will 
be  the  exclusive  source  until  its  marginal  cost  in  the  increasing  cost 
range  becomes  greater  than  the  marginal  cost  of  pool  one.   Then  pool  one 
or  two  will  alternately  be  exploited,  depending  on  which  is  currently 
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the  cheaper  source  at  the  margin  (in  a  continuous  formulation,  both  would 
simultaneously  be  tapped).   That  is,  if  pool  one  has  lower  incremental 
cost  than  two,  it  will  be  exploited  until  its  incremental  cost  exceeds 
two's,  then  two  will  be  exploited  until  its  incremental  cost  exceeds  one's, 
etc.   This  rising  marginal  cost  phase  ends  when  the  implicit  cost  of  the 
third  pool  is  found  to  be  lower  than  the  incremental  costs  of  the  first 
two  pools.   Then  the  third  pool  alone  is  tapped,  up  to  the  point  where  its 
incremental  cost  in  the  increasing  cost  range  becomes  greater  than  the 
incremental  cost  of  the  first  or  the  second  pool.   From  then  on  all  three 
pools  are  alternately  exploited,  that  pool  being  tapped  at  any  given  time 
which  has  lowest  marginal  cost. 

Note  that  in  a  situation  where  all  pools  are  the  same  and  there 
is  an  unlimited  collection  of  them,  the  optimal  policy  will  be  cyclic  or 
recursive.   The  same  conclusion  holds  if  there  are  several  classes  of 
pools,  each  class  containing  an  Infinite  number  of  identical  pools  (because 
in  an  optimal  strategy  only  pools  from  one  class  will  be  tapped).   This  is 
why  so  many  of  the  standard  operations  research  models  with  renewable 
options  —  end  up  having  a  repetitive  solution  which  may  be  universally 
characterized  as  follows.   At  each  decision  node,  choose  the  strategy 
element  with  lowest  equivalent  stationary  cost. 

The  Basic  Monotonicity  Property 

The  correctness  of  decision  rule  (10) , (11)  derives  ultimately  from 
an  important  monotonicity  property. 


ft/ 

—  For  example,  the  simple  deterministic  models  of:   equipment  selection 

and  replacement,  inventory  and  production  scheduling,  or  capacity  expansion. 
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Pick  any  time  t.   Suppose  that  pool  i  is  being  exploited  throughout 
the  interval  [t,t+k)  but  not  in  period  t+k,  for  k  some  positive  integer.— 
Then  pick  any  time  t'  >  t+k.   Suppose  that  pool  i'  is  exploited  throughout 
the  interval  [t'-k',t')  but  not  in  period  t'-k'-l,  for  some  positive  integer 
k'.   (Naturally  t+k  <  t'-k'.) 

The  basic  monotonicity  property  is  that  with  an  optimal  policy 
the  equivalant  stationary  cost  of  exploiting  pool  i  over  the  interval  [t,t+k) 
must  be  no  greater  than  the  equivalent  stationary  cost  of  exploiting  i'  over 
[t'-k',t').   In  symbols, 

l-^Cm.Ct))  <  >fj!(m.,(t'-k'))  .  (14) 

When  t+k  =  t'-k',  this  fundamental  lemma  can  be  proved  by  a  simple 
variational  argument  which  reverses  the  order  of  pool  exploitation.  Suppose 
the  original  exploitation  pattern  looks  like  this: 


pool  i' 


time 


Figure  1 


—  If  k=°°,  the  monotonicity  property  is  empty.   But  an  important  corollary, 
to  be  explained  later,  still  holds. 
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If  the  equivalent  stationary  cost  of  exploiting  pool  i  over  the  interval 
[t,t+k)  is  greater  than  pool  i'  over  [t'-k',t'),  it  will  be  cheaper  to  delay 
incurring  the  higher  cost  by  changing  the  exploitation  pattern  to: 


pool  i'        pool  i 
x_ 


■^Z     ^^- ^A 

1 1 ^  time 

t+k'  t' 


Figure  2 

More  formally,  when  t+k  =  t'-k'  the  present  discounted  cost  over 
the  interval  [t,t')  of  the  allegedly  optimal  policy  which  exploits  pool  i 
in  the  sub-interval  [t,t+k)  and  pool  i'  in  the  sub-interval  [t'-k',t')  = 
[t+k,t')  is 

k  k' 

"^  TI^  'l'^^-(t))  +  cc^^^tE!—  '^J!(m,,(t+k))  .        (15) 

By  comparison,  the  present  discounted  cost  of  the  feasible  alternative  policy 
which  would  reverse  the  order  of  exploitation  of  pools  i  and  i'  in  the 
interval  [t,t')  but  would  otherwise  leave  the  system  in  the  same  state 
at  times  t  and  t '  is 

k'  k 

t  1-a    „,k' ,    ,  .,,.  ,   t+k'  1-a   „,k,   ,  ^^        ^-.^x 
a  Y^ —  f^,(m^,(t+k))  +  a     j^     'l'^(m^(t))  .       (16) 
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Because  the  original  policy  was  optimal,  the  value  of  expression 
(15)  must  not  be  higher  than  (16).   The  resulting  inequality  reduces  to 

>i'J(m^(t))  <  ?J,'(m.,(t+k))  ,  (17) 

which  is  equivalent  to  (14)  for  t+k  =  t'-k'. 

Suppose  now  that  t+k  <  t'-k'.   Divide  the  time  interval  [t+k, t'-k') 

into  r  contiguous  sub-intervals  of  the  form  [t.  ,,t.)  with  t.  >  t.  ,.   The 

J-1  J        J     J-1 

divisions  are  made  so  that  during  each  sub-interval  one  distinct  pool  is 

being  exploited.   Let  pool  i.  (i.  ^f   i.  ,  and  i.  4   i.,,)  be  tapped  in  the 

J    J     J-1      J     1+1 

time  interval  [t.  ,,t.)  for  i  =  l,2,''*,r  where  t_  e  t+k,  t   =  t'-k',  i„  =  i, 

J-1     3  J  >    »        >  Q  >      J.  )      Q  J 

i  ^  =  i'.   Repeatedly  applying  the  appropriate  form  of  result  (17)  at  each 

r 
of  the  r+1  transition  times  {t.}     (where  exploitation  of  one  pool  ends 

J  3=0 
and  of  another  begins)  yields  a  string  of  inequalities  which  can  be  collapsed 

into  (14). 

The  basic  idea  is  illustrated  in  Figure  3  for  the  case  r  =  2. 
From  what  has  previous Ij'  been  shown,  the  equivalent  stationary  cost  over 


>-   time 


Figure  3 
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the  interval  [t,t+k)  is  <  over  [t+k,t  ),  which  is  <  over  [t  ,t'-k'),  which  is 
<  over  [t'-k',t').   This  concludes  our  proof  of  the  basic  monotonicity 
property. 

The  following  corollary  to  the  basic  monotonicity  property  is 
important. 

Let  i'  be  any  pool  and  k'  any  positive  integer.   Suppose  there  is 
a  time  T  such  that  pool  i'  is  never  tapped  after  T.   Let  the  equivalent 
stationary  cost  of  (hypothetically)  exploiting  i'  over  the  time  interval 
[T,T+k')  be 

y  =   i-Jicm^.a))  .  (18) 

Suppose  pool  i  is  is  exploited  throughout  the  interval  [T,T+k)  but 

8/ 
not  in  period  T+k,  for  some  positive  —  integer  k.   Let  the  equivalent 

stationary  cost  of  exploiting  i  over  the  time  interval  [T,T+k)  be 

A  E  f^(m.(T))  .  (19) 

In  an  optimal  policy,  it  must  be  that 

A  <  u  .  (20) 

The  proof  of  corollary  (20)  is  as  follows.   Let  the  equivalent 
stationary  cost  at  time  T  of  the  optimal  policy  over  the  entire  interval  [T,oo), 
during  which  there  is  no  extraction  from  i' ,  be  3.   In  other  words,  if  the 
allegedly  optimal  policy  costs  f(0)  in  period  0, 


8/ 

—  We  also  allow  the  possibility  k=°°.   In  this  case  it  will  be  evident  that 

the  proof  of  the  corollary  still  goes  through,  although  there  is  no  opportunity 

or  need  to  apply  the  basic  monotonicity  property.   If  k'=°°,  the  statement  and 

proof  of  the  corollary  are  likewise  unaffected. 
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OO 

6  E  (l-a)   E   f(e)  a®"^  .  (21) 

e=T 

By  the  basic  monotonicity  property,  A  must  be  no  greater  than  the 
equivalent  stationary  cost  over  any  sub-interval  of  [T,°°)  during  which  one 
distinct  pool  is  being  exploited.   But  there  is  an  easy  generalization  of 
(12)  which  says  that  the  equivalent  stationary  cost  of  extracting  over  any 
interval  is  a  weighted  average  of  the  equivalent  stationary  costs  of  ex- 
tracting over  each  of  its  component  sub- intervals.   It  follows  that 

A  <  B  .  (22) 

Consider  an  alternative  policy  which  exploits  pool  i'  in  the 
interval  [T,T+k')  while  postponing  to  a  starting  date  of  T+k'  the  allegedly 
optimal  policy  which  had  previously  begun  at  T.   In  other  words,  what  was 
policy  over  [T,°°)  now  becomes  policy  over  [T+k'  ,°°)  and  pool  i'  is  exploited 
in  the  interval  [T,T+k').   Before  time  T,  the  original  and  alternative 
strategies  coincide. 

The  cost  over  [T,"")  of  the  alternative  variation  discounted  to 
time  T  is 

,   k'       k' 

l-g      ,  a 

l-a        l-a 

This  must  be  no  less  than  the  corresponding  discounted  cost  of  the  original 
optimal  policy 


l-a 
It  follows  that 


<  M  .  (23) 
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Combining  (22)  and  (23)  yields  the  desired  corollary  (20). 

Proof  of  Optimality 

Our  proof  of  the  main  result  is  by  contradiction.   Suppose  that 
the  proposed  decision  rule  (10), (11)  is  non-optimal.   If  some  other  policy 
is  optimal,  there  must  be  a  time  t  and  a  pool  j  such  that 


$,(m.(t))  >  $,.(m..(t))  (24) 


and  yet 


Vj(t)  =  1  ,  (25) 

instead  of  v.^(t)  =  1. 

In  this  allegedly  optimal  policy,  let  t+k  be  the  first  period  after 
t  that  some  pool  other  than  j  is  exploited.   The  equivalent  stationary  cost 
of  exploiting  pool  j  over  the  interval  [t,t+k)  is 

E   F.(m.(t+h))  a" 
_  h=0  ^   ^ ,„,. 

^  = k^TT •  ^^^^ 

h=0 

From  the  definition  of  $.(m.(t)), 

$  (m  (t))  <  Y  , 

which  can  be  combined  with  (24)  to  yield 

$.^(m.^(t))  <  Y  .  (27) 

Let  n*  be  the  value  of  n  which  satisfies  (8)  for  i=i*,  m=m   (t). 
If  n*=°°,  a  direct  argument  based  on  (27)  and  the  monotonicity  property  shows 
that  exploiting  only  pool  i*  from  time  t  on  has  a  present  discounted  cost 
lower  than  the  proposed  policy.   In  what  follows,  the  case  n*  <  °°  is  treated. 
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Although  pool  j  is  being  exploited  at  time  t  instead  of  pool  i*, 
it  may  be  that  i*  will  be  exploited  again  at  some  future  dates.   Suppose, 
as  one  case,  that  at  least  m   (t)  +  n*  units  are  eventually  extracted  from 
pool  i  in  the  allegedly  optimal  policy  with  (24) , (25)  holding.   Let  t'  be 
the  first  time  when 

m^^(t')  =  m^^(t)  +  n*  . 

Let  k'  be  the  positive  integer  such  that  pool  i*  is  exploited  throughout 
the  interval  [t'-k',t'),  but  not  at  time  t'-k'-l. 


becomes 


For  the  values  i=i*,  m=m   (t) ,  n=n*,  h=n*-k' ,  expression  (13) 

1* 


^^l(in.^(t)+n*-k')  <  0.^(m^^(t))  .        (28) 
Combining  (28)  with  (2  7)  yields 

4'^^(m^^(t)+n*-k')  <  Y  •  (29) 

Now  inequality  (29)  is  a  direct  contradiction  with  the  basic 
monotonicity  property  (14).   A  time  segment  [t,t+k)  with  higher  equivalent 
stationary  cost  y  is  preceeding  a  segment  [t'-k',t')  with  lower  equivalent 
stationary  cost  T.^(m.^(t)+n*-k') . 

A  contradiction  (now  with  the  corollary)  is  also  implied  if  after 
time  t  a  total  of  less  than  n*  additional  units  are  to  be  taken  from  pool 
i*  in  an  allegedly  optimal  policy.   Let  T  be  whichever  is  larger  of  t 
and  the  time  right  after  the  last  unit  has  been  taken  from  pool  i*  (considered 
to  be  zero  if  i*  is  never  tapped).   In  the  present  context  let  k'  be  defined 
as  the  positive  integer  satisfying 


m.^(T)  =  lira  m.^(s)  =  m.^(t)  +  n*  -  k'  .      (30) 
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For  k'  so  defined,  (29)  continues  to  hold.   Using  (30),  (29) 
becomes 

Y^^(m.^(T))  <  Y  .  (31) 

Employing  definition  (18)  for  i'  =    ±* ,    (20)  becomes 

By  the  basic  monotonicity  property  and  the  definition  of  T, 

y  <   X    .  (33) 

Inequalities  (32)  and  (33)  can  be  combined  to  yield 

>^iI(in.^(T))  >  Y 

which  is  a  direct  contradiction  with  (31). 

This  concludes  our  proof  of  the  form  of  an  optimal  policy. 

Scarcity  Price  of  the  Resource 

By  'efficiency',  'shadow',  or  'scarcity'  price  of  the  resource  at 
time  t  is  meant  its  marginal  value  at  that  time,  defined  in  terms  of  the  overall 
objective  (1),  discounted  to  t.   It  turns  out  that  the  efficiency  price  of 
the  generalized  resource  can  be  represented  by  a  simple  expression  with  a 
direct  economic  interpretation. 

The  shadow  price  of  the  resource  at  time  t  is 


p^  =  (1-a)   E   F*(s)  a^  ^  ,  (34) 

s=t 


where 


F*(«)  -=   Fi*(s)K*(^))  (35) 
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is  the  extraction  cost  in  period  s  of  an  optimal  policy. 

The  above  result  has  a  most  interesting  interpretation.   The 
current  shadow  price  is  an  exponentially  weighted  (by  the  discount  rate) 
average  of  all  future  costs.   At  any  time  the  scarcity  price  of  an  exhaustible 
resource  is  the  equivalent  stationary  cost  of  exploitation  henceforth.   In 
other  words,  the  efficiency  price  is  exactly  equal  to  that  hypothetical  constant 
extraction  cost  per  unit  which  would  yield  the  same  present  discounted 
cost  as  an  optimal  policy. 

That  the  shadow  price  must  have  the  form  (34)  is  easily  demonstrated. 
Suppose  that  an  extra  unit  of  the  resource  is  given  as  a  free  gift  in  period  t 
(recall  that  the  resource  could  have  been  measured  in  arbitrarily  small  units 
to  begin  with).   Consider  the  following  proposed  policy,  which  is  now 
feasible.   Up  to  time  t,  follow  exactly  the  same  extraction  pattern  as  the 
original  optimal  solution  (10) , (11) ,   During  the  time  interval  [t,t+l), 
take  advantage  of  the  free  gift  and  extract  nothing.   Throughout  the  interval 
[s,s+l)  for  s  >  t+1,  extract  from  the  pool  i*(s-l)  which  in  the  original 
optimal  solution  had  been  exploited  during  the  previous  interval  [s-l,s). 

The  cost  of  such  a  policy  discounted  to  period  t  is 

C'  =  T.     F*(s)  a^   +  E  F*(s-1)  a^    . 
^        s=0  t+1 

The  cost  of  the  original  optimal  policy  discounted  to  period  t  is 


C  =   E   F*(s)  a^  ^ 
^   s=0 


The  difference  is 


C   -  C  =  (1-a)   E   F*(s)  a^  ^ 
t    t 

s=t 
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Thus ,  the  gain  achieved  by  the  gift  of  an  extra  unit  of  the  resource 

9/ 
in  period  t  is  at  least  —  p  ,  defined  by  (34).   Using  a  similar  argument, 

the  loss  incurred  by  taking  away  one  unit  of  the  resource  in  period  t-1  is 

no  greater  than  p  .   The  efficiency  price  of  the  resource  at  time  t,  defined 

as  its  undiscounted  marginal  value,  —  is  therefore  given  by  expression  (34). 

As  a  consequence  of  (34) ,  the  efficiency  price  of  a  depletable 

resource  can  have  a  complicated  and  interesting  time  profile.   It  may  temporarily 

decline,  corresponding  to  the  development  of  a  source  having  a  significant 

range  of  decreasing  costs  (the  decline  will  be  permanent  if  the  decreasing 

cost  stage  lasts  forever).   In  the  long  run,  when  different  pools  are 

exploited  the  shadow  price  trends  upward  because  the  resource  is  becoming 

more  expensive  as  the  cheaper  sources  are  exhausted;  this  increase  will  of 

course  be  monotonic  in  the  classical  case  where  each  pool  has  nondecreasing 

costs.   For  operations-research  type  problems  with  renewable  options,  the 

shadow  price  exhibits  a  wave  like  recursive  pattern. 


9/ 

—  It  would  be  exactly  p   if  the  gift  came  as  an  unforseen  "surprise"  at  t, 

in  which  case  the  proposed  policy  would  be  optimal.   If  the  date  of  the  gift 
is  known  beforehand,  a  planner  may  be  able  to  cut  present  discounted  costs 
by  altering  the  proposed  extraction  schedule. 

—  A  few  mathematical  fine  points  are  being  glossed  over  in  favor  of  a  cleaner 
exposition.   For  example,  left  and  right  hand  side  marginal  products  need  not 
be  equal  to  each  other;  the  proposed  shadow  price  is  merely  "trapped  between" 
them.   Also,  it  is  implicitly  being  assumed  that  time  periods  are  "sufficiently 
small"  to  justify  the  variational  arguments  being  employed. 
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